Artificial Intelligence Project— RLE and MIT Computation Center 
Symbol Manipulating Language— Memo 13— The Maling-Silver Read Program 

v J K. Maling 

Three typea of expressions can read 

(1) m - expressions 

(2) s - expressions 

(3) algebraic expressions. 

The program uses RDB, which means that single embedded 
blanks may be part of a print name; that a left parenthesis 
followed by, or a right parenthesis preceded by, any combi- 
nation of periods and commas Is treated as a special paren- 
thesis j and that any combination of + - ■ « * 1 is an "operation" 
group. > 

It is written in LIS? and when used with the apply opera- 
tor, the cards bearing the expressions must follow the STOP 
card. The card immediately following the STOP card must con- 
tain the special list of operation and punctuation characters, 
many of which can only be read in by the use of RDB. A func- 
tion DEFPRED uses this card to generate predicates for these 
characters. 

RDA and RDB have independent buffers so that they can be 
used in the same program without confusion. Betern does for 
RDB what INTERN does for RDA. A Rollo expression becomes a- 
vailable to the apply operator by executing the function 
AROLIST, which has no arguments. 

Unless an expression is preceded by ++ to indicate "QUOTE" 
or +* to indicate algebraic mode; 1*3 will be assumed to be an 
in-expression. ARGLIST expects one or several self-contained 
expressions separated by commas and terminating with any sym- 
bol or character other than a left parenthesis, comma, period 
or operation. The value of AROLIST is a list of translated 
expressions. 

The operation and punctuation characters consist of the 
following set 

*-** define 

— yields 

« equal 

+ or, plus 

* and, times 

/ of, ratio 

rot, mJ'-Ki.d 
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power 
/* lambda 
•H- quote 
~+ unquote 
+* algebraic raode 
-♦in - mode 

m - expression parenthesis, s - expression connective 



5) 



see below 



The program is not debugged to the point of my happiness, 
and the following items may warrant attention, 

(1) S - expressions: 

Add implied comma for these combinations 
) ( ) symbol group symbol group ( 
introduce special parentheses. 

(2) Algebraic expressions. 

The very simple function 'INFIX" does not make the 
proper association in certain special oases, namely 
tt+b«*c«d is read as (a+(b**(c*d))) 
instead of (a+((b**c)*<0) 

The more complex equivalent function for m - ex- 
pressions solves this difficulty and the method can be 
copied for the algebraic mode. 

If already in the algebraic mode, ** should be ig- 
nored. This is not so now. 

(3) M - expressions. 

If already in the m - mode, -* should be ignored. 
This is not so now. 

The period - parenthesis should not require a match- 
ing period if the latter is at the beginning or end of 
the expression. 

No use has been made of the fact that the value of 
RD is OPER Punct LPAR RPAR etc. This results in a slow- 
er program. 

There is no special character for PROG. 
Usage and Errors . 

The function ARQLIST has as its value a list of ex- 
pressions. That is why the examples given appear to have one 
redundant nesting of parentheses ground the equivalent s - 
expressions . Suppose we punch en a card 



. ^r-sr ■ - ^"^^f#^ 
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1*(u)(u~h-YES— Pr;i.nt/-H-OK,T— NIL),-h-YES $$ 
Then this card is placed after the operation characters card 
which follows the STOP card* 
Let the apply operator now execute 

(EVAL,(ARGHIST),()) 




Then the above function will be applied to its argument. 

The most common error is the failure to leave a space be- 
tween punctuation characters, or operation characters which are 
Intended to be distinct, 

e.g. U«++YES instead of U* -h-YES 

or (A— B, .C+D.— E) instead of (A--B, .C+D.— E) 

An expression or list of expressions separated by commas 
must have a termination symbol. $ ?$ causes the rest of the 
card uo be ignored. This is a feature of RDB. 

Oroe a grammatical error occurs the reading process usual- 
ly gets out of step; it may therefore be advantageous to in- 
sert a dummy arglist with this particular termination symbol be- 
tween expressions. This thought has not been debugged. 

Expressions included in any one list may be of any kind in 
any order, in - s - or algebraic, but care must be taken to re- 
store in - .node after an algebraic expression, if a non-alge- 
braic expression follows within the same arglist. Every arglist 
starts out in the in- mode anew. 
M-Expresslonc 

The notation in which in-express ions are written is de- 
signed to mininize the number of parentheses used. There are 
two characters that are interpreted as or imply parentheses, 
namely 'the slash and the period. 

CAR/L translates to ((CAR,L)) 
/READ translates to {(READ)) 
a*.b+c.'d translates to (y».M£»,A,(AND, (OR,B,C),D))) 

Thus functions of one or more arguments can be written 
without parentheses . For functions of more than one argument 
however, one writes 

LIST (A,B,C,D) translates to ((LIST,A,B,C,D)) 

Binary connectives have been given an order of dominance. 
In order of decreasing strength they are 

.«.._$«f «/ j DSP,- YIELDS, EtH'AL, OR, Ajn>^ t)V 
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Note that + and * are binary s and A«B«C*D is correctly 
( Tout not efficiently) translated. 

This order of dominance implies the existence of paren- 
theses . 

A w B/C * D— PE=*F+G will be rsad as if 
(A» £ ( (B/C)»D) -> (E«(F+G) ) ) ) 

A different method of parenthesizing can be imposed by in- 
serting periods as if they were parentheses » There is no ambi- 
guity since a period in front of a connective must be a closing 
parenthesis and a period after a connective must be an opening 
parenthesis. They must be matched. 

Periods are usually necessary tfhen NOT is intended, e.g. 
-.ATOM/LI. translates to ((N0T,(A?CM,L1))) 

Normal, parenthesis may enclose 
1) the arguments of a function 
2} the pairs of a conditional expression 
3) the variables of a Lambda expression 

and the terus are separated by comriss. Normal parentheses 
should never be used in any other uense. 
(P1-$E1,P2-*B2) translates to ((COND, (P1,E1, ), (P2,E2))) 
*(X,Y)Z(U,V) translates to (({LAMBDA, (X,Y),Z),U,V)) 
X(X,Y)(X-»Y) translates to ((LAM3))A, (X,Y),(COND, (X,Y)))) 

Nota that the triplet "x (variables )expression" can be 
Juxtaposed to a list of arguments just like the name of a 
function. 

If one is in the algebraic mode and the first symbol of 
the next expression is -*, then th<» remainder of the expression 
will be read as an in-expression. 
S-Expressio nB 

The "</uote" symbol is-++ and must be followed by a gram- 
matically oorrect S-expre3sion, including all commas. However, 
any number of trailing right parentheses can be made up by the 
"unquote" symbol which is -+ . If ;he number of right paren- 
theses is oorrect, no unquote symbol is required to restore the 
in-expression mode. Periods are permitted in the notation, so 
that ++(A.(B)) and -h-(A,B) are read identically. So also would 
be -h-(A.(B-+ of course. Every s -expression which has been read 
into the computer is represented by a list whose first element 
is the object QUOTE. A null expression results from ++ -+ . 
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Algebraic Expressions 
~ The operation characters are +*» «• l - m where - can be 
' used either as a unary or binary collective. 

The order of dominance Is * + /**• in decreasing strength- 
fhis impliad method of parenthesizing can be modified with con- 
ventional parentheses* 
A4B-C/D*n«e«F translates to ((BQOAL,(fM®,A f B)*(BatlO,C,(TIl®S t 

(powers,*) \ (mm** ) ) ) ) ) 

If the first character of an expression Is +• then this ex- 
preasion will be translated as algebraic, and so will sunae- 
quent ones until the mode is switched basic to ra-raode or the ar- 
glist meets a termination symbol. 



/"-> 
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APPENDIX 1-20-59 

The Convent ions on RD(B) 

1. Programming Conventions 

The instruction TSX RD,+ at L causes reading from cards or 
type accordingly as sense switch 1 is down or up respectively. 
One or more results will be read by the input equipment until 
the termination of a group as defined below has been reached. 
If the instruction is not the first read instruction in a pro- 
gram then it may not activate input equipment. This occurs if 
both beginning and end of a group lie within one recosfd that was 
read by a previous read instruction, 

RD(B) has two exits. The norma return is to L+2, but an 
end of file condition or a triple dollar sign causes return to 
L+l. RD(B) has two values when the normal return takes place. 
In the decrement of the accumulator is the machine name of the 
group type (one of seven possibilities). The group Itself is 
always stored in the same place, in a special list whose name 
is "VALVE". 

2. Typographical Conventions 

There are seven types of character groups 

(a) Symbols 

(b) Numbers 

(c) Operations 

(d) Punctuation 

(e) Left parentheses 

(f ) Right parentheses 

(g) Text 

A symbol group always begins wnth a literal (AtoZ) and 
contains any of 26 literals, 10 digits or any number of single, 
Isolated embedded blanks. Anything else terminates the group. 

A number group always begins with a digit and contains any 
of 26 literals, 10 digits or one embedded period. The embedded 
period must not be adjacent to a liberal. A blank may not be 
embedded. Anything else terminates the group. 

There are six operation characters, +•/--■. Hyphen 
and minus signs are treated without distinction. Any combi- 
nation of operation characters or one alone forms an operation 
group. Also any expression which does not contain dollar signs 
may become part of an operation grojp by enclosing it by dollar 
signs and writing it immediately after the last character of 



the operation group. The first enclosing dollar sign Is read In 
as a blank, the second is Ignored. 
V^ Any sequence of consecutive periods and/or commas is a 

punctuation group, provided it is not immediately preceded or 
followed by a left or right parentlwtsis respectively. 

A left parenthesis group Is a left parenthesis followed by 
any sequence of consecutive periods and/or commas. 

A right parenthesis group Is any sequence of consecutive 
periods and/or commas followed by a right parenthesis, providing 
this sequence is not preceded by a left parenthesis. If it is so 
preceded, the right parenthesis group consists of the right pa- 
renthesis alone. 

Text is any sequence of characters (other than dollar signs) 
enclosed by a pair of dollar signs., providing the leading dollar 
sign is not preceded by an operation character. Upon read-in, the 
first dollar sign is replaced by a blank, and the second is ig- 
nored- 

The number of characters in a group may be one to 120. The 
leading blank of text counts as one of the 120. 
/"-> A double dollar sign terminating text, or a triple dollar 
sign at any time causes exit to L+L A read instruction fol- 
lowing such a termination will start by reading new record. 

Blanks which precede or interpolate groups are supressed. 

A record contains 72 characters, whether on cards or tape. 
These characters are punched into columns 1-72 of cards. The 
72nd character of a record i3 followed without Interruption by 
the first character of the next reocrd. If the column punch- 
ing does not represent one of the i*6 legal characters then RD(B) 
goes to error. However, all combinations of legal characters 
are legal. 
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